You've learnt about all the core aspects of data exploration, data cleaning, and data visualisation. Download, unzip and open the notebook I've included for this assignment.

You'll find an incredibly rich dataset from nextspaceflight.com that includes all the space missions since the beginning of Space Race between the USA and the Soviet Union in 1957! It has data on the mission status (success/failure), the cost of the mission, the number of launches per country, and much much more. There's so much we can learn from this dataset about the dominant organisations and the trends over time. For example:

  • Who launched the most missions in any given year?

  • How has the cost of a space mission varied over time?

  • Which months are the most popular for launches?

  • Have space missions gotten safer or has the chance of failure remained unchanged?

I'm sure that you'll discover many more questions that you can formulate and answer with this dataset! Use it to practice what you learnt about creating various types of charts and visualisations, from choropleths to sunburst charts to segmented bar charts and see if you can turn data into insight. Good luck!


Questions for this assignment

Reflection Time:

This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.

Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?

Download resource files

Chittyala Sanjeeth
Posted 26 days ago

Reflection Time:

This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.

Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?

sklsbjso

Give Feedback

What went well? What could be improved?

Vanshika S Deshpande
Posted 27 days ago

Reflection Time:

This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.

Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?

import pandas as pd

import matplotlib.pyplot as plt


# Load the space mission data from a CSV file

data = pd.read_csv('space_missions.csv')


# Convert the date column to datetime format

data['Date'] = pd.to_datetime(data['Date'])


# Extract year from the date column

data['Year'] = data['Date'].dt.year


# Group the data by year and count the number of missions

missions_per_year = data.groupby('Year').size()


# Plot the number of missions per year

plt.figure(figsize=(10, 6))

missions_per_year.plot(kind='line', marker='o')

plt.title('Number of Space Missions per Year')

plt.xlabel('Year')

plt.ylabel('Number of Missions')

plt.grid(True)

plt.show()


Give Feedback

What went well? What could be improved?

Jonathan Ross Balsicas
Posted 12 hours ago

Reflection Time:

This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.

Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?

# with this exercise , I created a template for charting, its a bit long cut process but it teaches me and let me memorize the methods by repition:

name_frequency = clean_data['Organisation'].value_counts().reset_index()
name_frequency.columns = ['name', 'frequency']
name_frequency.shape
name_frequency.head

plt.figure(figsize=(14,8))
plt.xticks(fontsize=14, rotation=90)
plt.yticks(fontsize=14)
plt.xlabel('Organisation', fontsize=14)
plt.ylabel('Number of Launch', fontsize=14)

plt.title('Number of Launches per Company', fontsize=24)
plt.bar(name_frequency.name[:24], name_frequency.frequency[:24])

# I also learned how to use plotly and matplotlib, and differenciate their pros and cons when using it

Give Feedback

What went well? What could be improved?